Using Suprasegmentals in Training Hidden Markov Models For Arabic

نویسنده

  • Ossama Essa
چکیده

Automatic speech segmentation is an essential tool for building large corpora for training continuous speech recognition systems. Manual segmentation of speech is both time consuming and an error-prone task. Several automatic segmentation systems have been proposed based on the acoustical features of the speech 5] 11]. In this paper, we present a novel technique for automatic seg-mentation of Arabic speech in which both prosodic and acoustical features of the speech are examined to achieve a higher accuracy of segmentation. The system was used to automatically label 1012 utterances of Koranic Arabic. These utterances were then used to train discrete density Hiddem Markov Models (HMM). The resulting models were test on 105 manually segmented utterances. Koranic Arabic speech is the rhythmic speech used in reciting the Koran and is considered the standards for Modern Standard Arabic (MSA) by most Arabic linguists. We show that incorporating the prosodic features in the design resulted in better segmentation accuracy.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Introducing Busy Customer Portfolio Using Hidden Markov Model

Due to the effective role of Markov models in customer relationship management (CRM), there is a lack of comprehensive literature review which contains all related literatures. In this paper the focus is on academic databases to find all the articles that had been published in 2011 and earlier. One hundred articles were identified and reviewed to find direct relevance for applying Markov models...

متن کامل

Intrusion Detection Using Evolutionary Hidden Markov Model

Intrusion detection systems are responsible for diagnosing and detecting any unauthorized use of the system, exploitation or destruction, which is able to prevent cyber-attacks using the network package analysis. one of the major challenges in the use of these tools is lack of educational patterns of attacks on the part of the engine analysis; engine failure that caused the complete training,  ...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007